Transforming Pitch in a Voice Conversion Framework

نویسنده

  • Zeynep Inanoglu
چکیده

A subtask of voice conversion is to accurately map the pitch contour of a source speaker to a target speaker. So far, the most widely employed method for carrying out this mapping is based on adjusting the pitch range of the source speaker to match the target while keeping the shape of the contour unchanged. In this project, we investigate four alternative algorithms for pitch contour mapping and compare their performance with the popular baseline method in an objective framework as well as through perceptual tests. The first two methods extend the baseline to allow more complex mappings of the pitch range, while the last two methods aim to impart an entirely new contour onto the target. We have found that all four methods improve the baseline for most cases. The amount of improvement, however, varies from method to method and manifests a dependency on the nature of the training data available.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Applying voice conversion to concatenative singing-voice synthesis

This work address the application of Voice Conversion to singing-voice. The GMM-based approach was applied to VOCALOID, a concatenative singing synthesizer, to perform singer timbre conversion. The conversion framework was applied to full-quality singing databases, achieving a satisfactory conversion effect on the synthesized utterances. We report in this paper the results of our experimentatio...

متن کامل

Parametric Speech Coding Framework for Voice Conversion Based on Mixed Excitation Model

Adaptation of mixed-excitation linear predictive (MELP) model for application in voice conversion is presented. The adapted model features only numerical parameters which can be used for phonetic space transformation from source to target speaker using methods of machine learning. The validity of the model was demonstrated by applying transformation to both the pitch and the spectral envelope o...

متن کامل

Non-linear Pitch Modification in Voice Conversion Using Artificial Neural Networks

Majority of the current voice conversion methods do not focus on the modelling local variations of pitch contour, but only on linear modification of the pitch values, based on means and standard deviations. However, a significant amount of speaker related information is also present in pitch contour. In this paper we propose a non-linear pitch modification method for mapping the pitch contours ...

متن کامل

Speech Analysis – Synthesis Based on the Ptdft for Voice Conversion

Voice conversion problem became very popular in the world. It has applications in many fields, for example in systems that make use of prerecorded speech, such as voice mailboxes or text-to-speech synthesizers based on acoustic unit concatenation. In such cases, voice modification would be a simple and efficient way to create a desired variety of voices while avoiding recording of different spe...

متن کامل

Voice Conversion using Convolutional Neural Networks

The human auditory system is able to distinguish the vocal source of thousands of speakers, yet not much is known about what features the auditory system uses to do this. Fourier Transforms are capable of capturing the pitch and harmonic structure of the speaker but this alone proves insufficient at identifying speakers uniquely. The remaining structure, often referred to as timbre, is critical...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003